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Group theoretical arguments combined with normal mode analysis techniques are applied to a 
coarse-grained approximation of icosahedral viral capsids which incorporates areas of variable flexi- 
bility. This highlights a remarkable structure of the low-frequency spectrum in this approximation, 
namely the existence of a plateau of 24 near zero-modes with universal group theory content. 
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I. INTRODUCTION AND SUMMARY 

Proteins in a thermal bath exhibit a wide spectrum of 
dynamical behaviours which can be probed, for instance, 
by inelastic neutron diffusion (see [1] for a review). In 
particular, they undergo slow, large-amplitude motions 
which are now widely believed to be instrumental to their 
function. These ideas were first tested in [2, 0, 0, [a, [fl 0| 
where normal mode analysis (NMA) was used to argue 
that only a few low-frequency normal modes of vibra- 
tion are sufficient to describe, with great accuracy, the 
conformational changes in a variety of proteins. The 
method has limitations, as it assumes the existence of 
a single potential well whose minimum is a given stable 
conformation of the protein under study, and therefore 
overlooks the possibility of neighbouring multi-minima 
of energy reported to exist in ^. Qj 2M- Yet, this tech- 
nique yields dynamical data which are consistent with 
experimental results on p roteins, as observed on case- 



and is also confirmed 
Although biologically 



by-case studies in [Tl], [Ij, [T 
by a recent statistical analysis 
significant low-frequency motions are typically not vibra- 
tional due to the damping influence of the protein envi- 
ronment, NMA captures the tendency of the biomolecule 
to change in a few particular directions corresponding 
to low-frequency normal modes, and thus remains a use- 
ful tool when time-dependent methods like molecular dy- 
namics are prohibitive. 

The success of NMA in studying protein dynamics 
has prompted its use in the context of large macro- 
biomolccular assemblies. The main motivation so far 
has been to pin down whether several experimentally ob- 
served conformations of viral particles could be inferred 
from one another by arguing that conformational changes 
occur in directions which maximally overlap with those 
of a few low-frequency putative normal modes of vibra- 
tion of the capsid [1^, [l3, [3 13 ■ The biggest challenge 
remains the choice, within the NMA framework, of a po- 
tential which optimally captures the physics of capsid vi- 
brations whilst taking into account a reduced number of 
degrees of freedom to enable practical calculations. Many 
NMA applied to viruses implement variations of the sim- 
ple Elastic Network Model proposed a decade ago [20| . 
in which the atoms are taken as point masses connected 



by springs modelling interatomic forces, provided the dis- 
tance between them is smaller than a given cutoff param- 
eter. Simplified versions include the restriction to C"- 
atoms only, the approximation in which each residue is 
considered as a point mass, or where even larger domains 
within the constituent coat proteins are treated as rigid 
blocks [l3| • In an effort to optimise the NMA techniques 
when applied to particles with high symmetry, group the- 
oretical considerations have also been exploited [1^, [l^ . 
When combined with an Elastic Network approach, it 
allows for more extensive normal modes calculations of 
viral capsids [l3| and compares well with results obtained 
using an Elastic Network- RTB setup [17]. 

The elastic potential in all analyses above has two ma- 
jor drawbacks: it does not discriminate between strong 
and weak bonds as it depends on a single spring con- 
stant, and it uses the rather crude technique of increasing 
the distance cutoff to resolve capsid instabilities. Con- 
sequently, the frequency spectra have much less struc- 
ture than one would expect in reality, and in particular 
fail to reproduce areas of rigidity and flexibility of the 
capsid satisfactorily. This problem has been addressed 
in [21| where a bond-cutoff method is implemented, to- 
gether with different spring constants for the various 
types of chemical interactions. The proposed model re- 
produces conformational changes better than the conven- 
tional distance-cutoff simulations. 

In this paper, we propose a reductionist approach to 
the study of icosahedral viral capsid vibrations within the 
harmonic approximation. We consider Caspar-Klug viral 
capsids, which are classified according to a triangulation 
number T |22l | . AT = n capsid [28| exhibits 60n coat pro- 
teins, organised in clusters of twelve pentamers located 
at the vertices of an icosahedron, and 10(n — 1) hexam- 
ers at global 3-fold and/or local 6-fold symmetry axes 
of the icosahedral capsid. In our coarse-graining, these 
coat proteins are approximated by point masses located 
at their centres of mass, calculated using the Protein 
Data Base (PDB). We set up an elastic network whose 
representatives are these point masses, with all masses 
normalised to 1. The network topology is determined 
by data from the VIPER website {22^ : two point masses 
are connected by a spring whenever VIPER provides a 
value for the association energy of the two corresponding 



proteins. The spring constants of our model are of the 
form Kmn — PmnK, with Pmn the ratio of the association 
energy of protein pair (m, n) to the largest association 
energy listed in VIPER, and k is a free parameter which 
reflects the lack of confidence in the absolute values of 
association energies published in VIPER. The potential 
reads, 
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where the vector r^ refers to the equilibrium position of 
protein to, and the vector fvi to its position after elastic 
displacement, all vectors originating from the centre of 
the capsid. The resulting force (stiffness) matrix thus 
depends on the parameter k, and possesses more than 
the six zero eigenvalues expected from the rotations and 
translations of the whole capsid whenever the number of 
constraints imposed by the association energies is smaller 
than 180n — 6, where 180n is the total number of degrees 
of freedom for a T = n capsid. 

Our main result, which will be substantiated by 
group theory arguments below, is the existence, in the 
low-frequency spectrum of all our coarse-grained sta- 
ble Caspar-Klug capsids, of 24 near-zero normal modes 
of vibration which always fall in the same set of non- 
singlet irreducible representations of the icosahedral 
group (see ([4])). The first singlet representation, which 
is associated with a fully symmetric mode, always ap- 
pears higher up in the spectrum, in accordance with the 
expectation that such a motion requires more energy to 
develop. The presence of 24 near-zero modes in the spec- 
trum of viral capsids is deeply rooted in the fact that the 
latter exhibit icosahedral symmetry. Although our model 
is too crude to account for a realistic flexibility of the vi- 
ral capsid, and therefore too primitive to get quantitative 
information on the virus function, it can serve as a first 
step in an analysis which should bridge the gap with ex- 
isting molecular dynamics simulations. It discriminates 
between strong and weak inter-protein bonds and there- 
fore captures effects of varying flexibility across the cap- 
sid. Moreover, it is simple enough to highlight universal 
aspects of the low-frequency spectrum of normal modes 
of vibration, inherited from the underlying icosahedral 
symmetry, and to pin down the influence of the elastic 
network design on the instability of the capsid. 

Our results should thus be viewed as a remarkable 
property of the lowest order approximation to virus cap- 
sid vibrations, which may potentially be relevant once 
subsequent orders, with additional interactions and ad- 
ditional degrees of freedom, are taken into account. A 
meaningful comparison with existing virus NMA compu- 
tations in the literature is difficult at this stage, partly 
because of reasons highlighted in [21[ regarding the use of 
Tirion's potential and the distance-cutoff method. How- 
ever, it is noteworthy that the first non-zero mode (a sin- 
glet of J) in the spectrum of IIK97, as calculated in [I^l 
using an all-atom simulation with a cutoff, appears at 



mode 31 and is therefore compatible with our qualitative 
arguments on the number of low- frequency modes [29| . 



II. LOW-FREQUENCY PLATEAU 

The remainder of this paper provides the group the- 
oretical arguments leading to the 24 near zero-mode 
plateau in our coarse-grained capsids. We first approxi- 
mate a Caspar-Klug T = n capsid by merging the 3n pro- 
teins per icosahedral face into a single point mass whose 
equilibrium position is at the centre of the face. These 
point masses are connected by springs (masses and spring 
constants normalised to 1) between nearest neighbours, 
forming a dodecahedral cage with 30 edges, dual to the 
icosahedron. Such a structure is unstable, since the num- 
ber of genuine degrees of freedom is 3 x 20 — 6 = 54, while 
there are only 30 constraints. Accordingly, the cage de- 
velops 24 non-trivial zero modes. To analyse these insta- 
bilities, we compare all possible motions of this dodecahe- 
dral cage consistent with icosahedral symmetry, with the 
dodecahedral motions induced by all possible motions of 
the 12 vertices of an icosahedron. By induced, we mean 
that the dual dodecahedron moves in such a way that 
its vertices are located at the centres of the (deformed) 
icosahedral faces at all times. 

Standard group theoretical methods reviewed in [2J| 
are used to calculate the decomposition of the 36- 
dimensional displacement representation Pj^^q ' of the 
vertices into irreducible representations of the full icosa- 
hedral group Tfi , which contains 60 proper rotations and 
an extra 60 elements obtained by multiplication of the 
latter by the inversion operation. The result is (see 
also (H), 

pdispl,36 -pi 
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= r^+r^+2r'l+r'l+r|+ri+2r^+p^ , (2) 



where the numerical superscripts indicate the dimen- 
sionality of the corresponding irreducible representation, 
while the ± subscripts refer to different irreducible rep- 
resentations of same dimension. If the subgroup X of 60 
proper rotations is used instead, ± representations are 
indistinguishable. The same group theoretical method 
yields, for the motion of the dodecahedral cage, the 60- 
dimensional displacement representation Pqod'e j 

I- rl' + p^ + Pi + 2r^ + Pi 

p;^ + Pi + p^ + Pi . (3) 
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Although Eq.([2]) depends only on icosahedral symmetry 
and not on actual links between icosahedral vertices, it 
is useful to visualise an icosahedral cage formed by point 
masses at its vertices joined by identical springs along 
its 30 edges. In contradistinction with the dodecahedral 
cage, the icosahedral cage would have no non-trivial zero- 
modes, in accordance with the fact that the number of its 
genuine degrees of freedom 3x12 — 6= 30 is equal to the 
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Note that the zero-modes belonging to the T^ representa- 
tion appear in ([3]) but have no counterpart in ^ . Hence 
they must induce no motion when mapped back to an 
icosahedral system, that is, the sum of the displacements 
of any 5 vertices defining a dodecahedral face must be 
zero. This is indeed the case as illustrated in Fig. [51 where 
the icosahedral system may be thought of as constructed 
from the centre-of-mass positions of the dodecahedron 
vertices of each pentagon face. The stabilisation of such 
a capsid requires the introduction of at least 24 further 
springs in a manner that respects the icosahedral symme- 
try. The magnitudes of the spring constants determine 
how much the zero-modes are lifted from zero, and shape 
the structure of the created low-frequency plateau. 
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FIG. 1: Frequencies of normal modes of vibration for (a) 
an icosahedral cage, (h) a dodecahedral cage. The modes 
l> resp. A belong to 3-dimensional irreducible representa- 
tions V%- resp. r± of the icosahedral group. The diamond 
(resp. pentagon) modes belong to 4 (resp. 5)-dimensional ir- 
reducible representations. Small (large) symbols refer to even 
(odd) parity. The x-axis labels the normal modes while the 
y-axis gives the wave numbers up to an overall normalisation. 



number of its constraints. The spectra of the icosahedral 
and dodecahedral cages are depicted in Fig.[lja)&(b). 

As no icosahedral motion leaves the dual dodecahedron 
fixed, the icosahedral motions induce a 36-dimensional 
vector space of (infinitesimal) motions of the dual do- 
decahedron. These include the six global zero-modes 
p3^ _l_ p3^ which are identical in both systems (as well 
as the global dilation vibrational mode y\). AH icosa- 
hedral normal modes of vibration pertaining to an irre- 
ducible representation of Ih in ^iqq ' must be linear 
combinations of dodecahedral normal modes belonging 
to the same irreducible representation. Hence, provided 
that the vibrational modes of the icosahedron have non- 
vanishing components in the vibrational modes of finite 
frequency of the dodecahedron, as is true by inspection, 
the set of representations in ([3]) which are not contained 
in (12) describe the non-trivial zero-modes of the dodeca- 
hedron. It follows that a capsid whose proteins would be 
modelled by 20 point masses located at the centre of each 
face of an icosahedron, and with 30 springs organised in 
a network modelled at equilibrium by a dodecahedron, 
develops 24 zero-modes of vibration which are organised 
in the following irreducible representations of the icosa- 
hedral group. 
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FIG. 2: Top and side views of the normal modes of a dodeca- 
hedral system belonging to the irreducible representation F^ "*" 
of the icosahedral group T. 

Despite its simplicity, the approximation just described 
catches an essential feature of the Caspar-Klug viruses, 
namely the 24-state low-energy plateau with group con- 
tent (|4]). Indeed, consider again a T — n capsid with 
3n proteins per icosahedral face but now treat them, 
in accordance with our introductory discussions, as 3n 
point masses located at the corresponding protein's cen- 
tre of mass calculated from the PDB files. We link them 
by springs according to the association energies listed in 
VIPER [30]. Neglecting first the links between different 
faces, we would expect a number of capsid zero-modes, 
No = (9n - 3A:) • 20 = 60 (3n - k). Here the 9n de- 
grees of freedom within a face are constrained only by 
3A: links, k integer, as a consequence of the global 3-fold 
symmetry of the icosahedron whose axis passes through 
the centre of the icosahedral face. A stable capsid is 
characterised by a force matrix having exactly six zero- 
modes, so that in principle, one needs to introduce at 
least 60(3n — k) — 6 independent constraints (i.e. bonds) 
to stabilise the whole capsid. How these must be chosen 
in a 3-dimensional context is a mathematical question 
which requires further investigation. It is however inter- 
esting to note that the VIPER website gives association 
energies for some inter-face proteins, which correspond 
to '[icosahedron] edge-crossing' bonds. By symmetry, 
these come as multiples of 30 or 60 when considering 
the capsid as a whole. We thus expect that, keeping the 
2(3n—fc) — l edge-crossing bonds per edge with the largest 
association energies, would yield a capsid with 24 non- 
trivial zero modes. In all stable coarse-grained capsids 
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bonds per edge, in accordance with 2(3n — A:) — 1 = 17, 
so that there are 24 non-trivial zero modes. The weak 
bonds serve to hft these zero modes, and the spectrum 
indeed exhibits again a very low-frequency plateau of 24 
modes (see Fig.|4]). A variety of other cap sids with similar 
spectral signatures are presented in [26|. 

IV. CONCLUSION AND DISCUSSION 



FIG. 3: (a) Bond structure of RYMV as given in VIPER, 
(b) Its forty lowest frequency modes showing a plateau of 24 
near-zero modes. Symbols and axes as in figure QJ 
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FIG. 4: (a) Bond structure of HK97 as given in VIPER, (b) 
Its forty lowest frequency modes showing a plateau of 24 near 
zero-modes with the representation content of (Q. Symbols 
and axes as in figure Ql 

we studied 26], this indeed happens, and adding extra 
edge-crossing bonds lifts the 24 zero-modes appearing in 
(|3]) to a low-frequency plateau. Our earlier approxima- 
tion thus simply amounts to replacing 3n — fc by unity in 
the above analysis. 

III. EXAMPLES 

We now illustrate our considerations with two exam- 
ples. The VIPER data for the T = 3 Rice Yellow Mottle 
Virus (PDB: lf2n) is sufficient to stabilise the capsid. It 
is reproduced schematically in Fig. [Altogether with the 40 
lowest-frequency normal modes of vibration. There are 
actually 11 bonds in Fig.[3l but 3 of them are superim- 
posed (2 linking green C-chains, and one linking a C- and 
a B-chain (red)). The total number of bonds is obtained 
from the figure using the global 2-, 3- and 5-fold symme- 
tries of the capsid. The capsids in our examples are not 
invariant under inversion, hence the symmetry group is 
I, and the irreducible representations are all of type '-I-'. 
Discarding the two weakest bonds (which are both edge- 
crossing) leads to k — 4 and 9 edge-crossing bonds per 
edge. In agreement with 2(3n — fc) — 1 = 9 we then ob- 
serve 24 non-trivial zero modes. Restoring the weak long- 
range arms which stretch from the (green) C-chains has 
the crucial effect of lifting the 24 zero modes to produce 
a low-frequency plateau. The T = 7£ Hong-Kong 97 bac- 
teriophage (PDB: 2fte) has a stable capsid too. Keeping 
all but the six weakest bonds (only one of them is edge- 
crossing in Fig. 2]) leads to A: = 12 and 17 edge-crossing 



We have analysed a model of icosahedral virus capsids 
which incorporates areas of varying flexibility. We have 
given a simple mathematical argument which explains 
the appearance of a low-frequency plateau of 24 states 
in the vibrational spectrum of Caspar-Klug viruses. Our 
top-down approach should be viewed as a first step in 
bridging the gap with the few molecular dynamics anal- 
yses published on viral capsids, and our results should 
be viewed as a remarkable property of the lowest order 
approximation to virus capsid vibrations. 

We wish to stress that if the amplitudes of the very 
low frequency modes were known, they could help decide 
whether they trigger instabilities and some of the confor- 
mational changes observed experimentally. Such changes 
play a fundamental role in the function of viruses, and 
a fundamental explanation of their origin is still lacking. 
It is moreover widely believed that the only motions that 
change the capsid surface, and hence influence signifi- 
cantly the interactions of the virus with the environment, 
are those global large amplitude, slow motions. Hence 
the importance of developing models that pin them down 
as accurately as possible. Another application of their 
knowledge is to provide experimenters with systematics 
on how to tune laser pulses in the near-infrared to pro- 
duce damage on viral capsids by forced resonance, an 
innovative technique developed recently [23| . 

We believe that our model can be developed beyond 
the crude approximation we have used in our paper, and 
would then greatly benefit from getting data on the abso- 
lute strength of the bonds, which would provide us with 
quantitative information on low frequency motions. In 
order to test our results, however, improved techniques 
are required as there are, at present, no experimental se- 
tups which could reliably and directly check our frequen- 
cies predictions (as the frequencies we target are too low 
for commonly used techniques). 
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